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In a generic switching problem, a switching pattern consists of a one-to-one mapping from a set of inputs to a 
set of outputs (i.e., a permutation). We propose and investigate a wireless switching framework in which a multi- 
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antenna relay is responsible for switching traffic among a set of N stations. We refer to such a relay as a MIMO 
switch. With beamforming and linear detection, the MIMO switch controls which stations are connected to which 
stations. Each beamforming matrix realizes a permutation pattern among the stations. We refer to the corresponding 
permutation matrix as a switch matrix. By scheduling a set of different switch matrices, full connectivity among 
the stations can be established. In this paper, we focus on "fair switching" in which equal amounts of traffic are to 
be delivered for all N(N — 1) ordered pairs of stations. In particular, we investigate how the system throughput can 
be maximized. In general, for large N the number of possible switch matrices (i.e., permutations) is huge, making 
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the scheduling problem combinatorially challenging. We show that for TV = 4 and 5, only a subset of N — 1 switch 

CO 

matrices need to be considered in the scheduling problem to achieve good throughput. We conjecture that this 
will be the case for large N as well. This conjecture, if valid, implies that for practical purposes, fair-switching 
scheduling is not an intractable problem. 
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I. Introduction 

Relaying in wireless networks plays a key role in various communication applications H). The use 
of relays can extend coverage as well as improve energy efficiency 0. In this paper, we study a set- 
up in which N stations communicate with each other via a multi-antenna relay. With beamforming, the 
relay controls which stations are connected to which other stations. Each beamforming matrix realizes a 
permutation among the stations represented by a switch matrix. By scheduling a set of different switch 
matrices, full connectivity among the stations can be established. 
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Prior work that investigated N stations exchanging data via a relay includes [0, [J3J , flU, and (5). Ref. 
[|2l studied "pairwise data exchange", in which stations form pairs, and two stations in a pair exchange data 
with each other only. Specifically for pairwise data exchange, if station i transmits to station j, then station 
j transmits to station i as well. In [2J, MIMO relays with different forwarding strategies were considered. 
Ref. also studied pairwise data exchange, but the relay adopts the decode-and-forward strategy only. 
The diversity-multiplexing tradeoffs under reciprocal and non-reciprocal channels were analyzed. Both fl2) 
and (31 studied the case in which a station communicates with one other station only. In a general setting, 
a station could have data for more than one station. In this paper, we focus on a uniform traffic setting 
in which the amounts of traffic from station i to station j are the same for all i,j £ {1, • • • , N}, i ^ j. 
"Fair switching" is used to meet the uniform traffic requirement. Specifically, fair switching is realized 
by scheduling a set of switch matrices. To the best of our knowledge, the framework of fair switching 
has not been considered in the existing literature. 

Refs. [4J and (H investigated the case of full data exchange, in which all stations want to broadcast 
their data to all other stations!]. Data transmissions in flU and B21 can be summarized as follows: in the 
first slot, all stations transmit to the relay simultaneously; the first slot is followed by multiple slots for 
downlink transmissions; in each downlink slot, the relay multiplies the signal received in the first time 
slot by a different beamforming matrix, such that at the end of all downlink slots, all stations receive 
the broadcast data from all other stations. By contrast, the framework investigated in this paper is more 
general in that it can accommodate the pure unicast case, the mixed unicast-multicast case, as well as the 
pure broadcast case as in [@] and flU. In particular, a station i can have data streams, and each station 
j 7^ i is a target receiver of one of the Mj streams. 

In our framework, the MIMO relay serves as a general switch that switches traffic among the stations. 
We use beamforming at the relay and linear detection to realize different connectivity patterns among 
the stations. Each beamforming matrix realizes a permutation among the stations represented by a switch 
matrix. By scheduling a set of switch matrices, the MIMO switching system can realize any general 
transmission pattern (unicast, multicast, broadcast, or a mixture of them) among the stations. 

Before delving into technical details, we provide a simple example to illustrate the scenario of interest 
to us here. Consider a network with three stations, 1, 2, and 3. The traffic flows among them are shown 
in Fig. Q] station 1 wants to transmit "a" to both stations 2 and 3; station 2 wants to transmit "6" and "c" 

'Note that full data exchange is also discussed in (Zj). But they consider a single-antenna relay. 
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to stations 1 and 3, respectively; station 3 wants to transmit "gT and "e" to stations 1 and 2, respectively. 
Pairwise data exchange as in [|2l and is not effective in this case because when the number of stations 
is odd, one station will always be left out when forming pairs. That is, when the number of stations is 
odd, the connectivity pattern realized by a switch/permutation matrix does not correspond to pairwise 
communication. Full data exchange is not appropriate either, since in our example, station 2 (as well as 
station 3) transmits different data to the other two stations. Under our framework, the traffic flows among 
stations can be met as shown in Fig. |2] In the first slot, station 1 transmits "a" to station 3; station 2 
transmits "6" to station 1; station 3 transmits "e" to station 2. In the second slot, station 1 transmits "a" 
to station 2; station 2 transmits "c" to station 3; station 3 transmits "d" to station 1. In Section III.C, we 
will present the details on how to realize the switch matrices. To limit the scope, this paper focuses on the 
use of amplify-and-forward relaying and zero forcing (ZF) to establish the permutations among stations. 

The rest of the paper is organized as follows: Section II describes the framework of wireless MIMO 
switching and introduces the ZF relaying method for establishing permutations among stations. The fair 
switching framework is presented in Section III. Section IV discusses our simulation results. Section V 
concludes this paper. 

II. System Description 

A. System Model 

Consider N stations, Si, • • • , Sn, each with one antenna, as shown in Fig. |3j The stations communicate 
via a relay R with N antennas and there is no direct link between any two stations. Each time slot is 
divided into two subslots. The first subslot is for uplink transmissions from the stations to the relay; the 
second subslot is for downlink transmissions from the relay to the stations. We assume the two subslots 
are of equal duration. Each time slot realizes a switching permutation, as described below. 

Consider one time slot. Let x — {x±, ■ • ■ ,xn} t be the vector representing the signals transmitted by the 
stations. We assume all stations use the same transmit power, normalized to one. Thus, E{af } = 1, V i. 
We also assume that E{xj} = 0, V i, and that there is no cooperative coding among the stations so 
that E{xj£j} = 0, V i ^ j. Let y = {yi,--- ,Vn} t be the received signals at the relay, and u = 
{u\, ■ ■ ■ ,un} t be the noise vector with i.i.d. noise samples following the complex Gaussian distribution, 
i.e., u n ~ A/" c (0, of). Then 

y = H u x + u, (1) 
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where H u is the uplink channel gain matrix. The relay multiplies y by a beamforming matrix G before 
relaying the signals. We impose a power constraint on the signals transmitted by the relay so that 

E{\\Gy\\ 2 }<p. (2) 

Combining (OQ) and ©, we have 

Tr[H%G H GH u ] + Tr[G H G]a 2 < p. (3) 

Let H d be the downlink channel matrix. Then, the received signals at the stations in vector form are 

r = H d Gy + w = 11 Gil r + H d Gu + w. (4) 

where w is the noise vector at the receiver, with the i.i.d. noise samples following the complex Gaussian 
distribution, i.e., w n ~ Af c (0, a 2 ). 



B. M1MO Switching 

Suppose that the purposes of G are to realize a particular permutation represented by the permutation 
matrix P, and to amplify the signals coming from the stations. That is, 

H d GH u = AP, (5) 

where A = diagjai, • • • ,ajv} is an "amplification" diagonal matrix. Define r = A _1 r, i.e., station Sj 
divides its received signal by aj. We can rewrite © as 

r= [x h ,--- ,x is ,--- ,x iN ] T + PH~ 1 u + A^w, (6) 

where is the station transmitting to Sj under the permutation P (i.e., in row j of P, element ij is 
one, and all other elements are zero). Suppose that we require the received signal-to-noise ratio (SNR) of 
each station to be the same. Let hr~?/.i be element (i, j) in H~\ Then 

k 1 j ] 

Note that a\ is the effective noise power for each station under unit signal power. 
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Substituting © into ©, we have 

i = E \&m 2 + E i E *&$><**UV £ p. <*> 

where h^~^ is element in -tf^ 1 . Let cij = \aj\e %dj , then combining © and ([8]) gives 

s = £ — ^mL^ + a r 2 V|V ^"-fe^ e f< p , (9) 

Problem Definition 1: Given H u ,Hd,p,a 2 ,a 2 , and a desired permutation P, solve for minimum a 2 
and the corresponding G. 

Random-phase Algorithm: For a given set of 9j, j = 1, • •• ,N, according to ©, lim q = 0, and 
lim g = +oo. Furthermore, q is a continuous function of <r e . Thus, there exists a a e 

such that q = p. Denote such a a e by <7 e (#i, • • • , 0jv). The problem consists of finding 

cr* = arg min cr e (0i, • • • ,9 N ). (10) 

01,— ,6jv 

We note that a e is a complicated nonlinear function of 9j. A time-consuming exhaustive search can be 
used to find the solution to (TTOl) . We use a random-phase algorithm to reduce the complexity. We divide 
the interval of [0, 2%) equally into M bins with the values of 0, ||, • • • , 2 ^ JV ^ 1 - >7r respectively and randomly 
pick among them to set the the value of 9j for each and every j = 1, ■ ■ ■ , N. After that, we compute the 
corresponding a e (9±, ■ ■ ■ ,9^) by solving © with the inequality set to equality. We perform L trials of 
these random phase assignments to obtain L different values of a e . We choose the smallest among them 
as our estimate for a*. Substituting the estimated a* into © yields \a,j\ for all j; hence G. Note that the 
solution found is a feasible solution and is in general larger than the actual optimal a*. In Section IV, 
we will show that large gains can be achieved with only small M and L. Moreover, increasing M and 
L further yields very little improvement, suggesting that the estimated a* with small M and L is near 
optimal. 

III. Fair Switching 

As has been described in the previous section, in each time slot, the stations transmit to one another 
according to a switch matrix. In this section, we study the fair switching scenario in which each station 
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has an equal amount of traffic for every other station. The data from station i to station j could be different 
for different j, so this is not restricted to the multicast or broadcast setting. To achieve fair switching, 
multiple transmissions using a succession of different switch matrices over different time slots will be 
needed. We next discuss the set of switch matrices. 

A. Derangement 

We assume a station does not transmit traffic to itself. A derangement is a permutation in which i is 
not mapped to itself [6J. While the number of distinct permutations with N stations is N\, the number 
of derangements is given by the recursive formula 

d N = N-d N -x + (-l) N , (11) 

where d\ = 0. For example, d A = 9 although the number of permutations is 4! = 24. It can be shown 
that linijv^oo 7^ = e" 1 and the limit is approached quite quickly. Thus, the number of derangements is in 
general very large for large N. Performing optimization over this large combinatorial set of derangements 
in our problem is a formidable task. For example, in our fair switching problem, we want to maximize 
the system throughput by scheduling over a subset of derangements. It would be nice if for our problem, 
the optimal solution is not very sensitive to the particular selection of derangements. In Part B, we will 
formalize the concept of "condensed derangement sets". 

B. Condensed Derangement Set 

Definition 1: A set of iV — 1 derangements, Di, D 2 , ■ ■ ■ , Dn-i, is sa id to be a condensed derangement 
set if 

AT-l 

J2Dn = J-I, (12) 

n=l 

where J is a matrix with all "1" elements, and I is the identity matrix. 

The four condensed derangement sets for N = 4 are Qi = {Pi, P$, P$}, Q2 = {Pi,Pe,Ps}, 
Q 3 = {P 2 , Pi, P 9 }, and Q 4 = {P 3 , P 5 , P 7 }, where P n are listed in TABLE I. There are d 5 = 44 
derangements for N = 5 and the number of condensed derangement sets is 56. 

In fair switching, we want to switch an equal amount of traffic from any station i to any station j, 
i ^ j. This can be achieved by scheduling the derangements in the condensed derangement set in a 
weighted round-robin manner (as detailed in "Approach to Problem 2" below). Given a condensed set, the 



scheduling to achieve fair switching is rather simple. However, different condensed sets could potentially 
yield solutions of different performance. And the number of condensed derangement sets is huge for large 
N. We define a problem as follows. 

Problem Definition 2: Suppose that we want to send equal amounts of traffic from Si to Sj V i ^ j. 
Which condensed derangement sets should be used to schedule transmissions? Does it matter? 

Approach to Problem 2: The derangements in a condensed derangement set are the building blocks for 
scheduling. For example, in a complete round of transmissions, we may schedule derangement D n for 
k n time slots. Then the length of the complete round transmissions will be J2n=i ^n- 

Consider the case of N = 4. There are four condensed derangement sets. The question is which 
condensed derangement set will result in the highest throughput. We could approach the problem as 
follows. 

Let Q m = {D™, D™, ■ ■ ■ , be a particular condensed derangement set. For each D™ , we use 

random-phase algorithm above to compute the corresponding a 2 e , denoted by o\ nm . The Shannon rate is 
then 

r n , m = log(l + — — ). (13) 

(7 

e,n,m 

Because of the uniform traffic assumption, we require k n ^ m r n ^ m = c, V n E [1, • • • , N — l], for some c. That 
is, c is the amount of traffic delivered from one station to another station in one round of transmissions. 
The effective throughput per station (i.e., the amount of traffic from a station to all other stations) is 



(iV-l)c _ N-l 
V^ 1 k ~ V s " 1 1 Tr 

Z^n=l ""n,m Z^n=l 



Numerically, we could first solve for r n m V n. Then, we apply (fl4l) to find the throughput. 

The question we want to answer is whether T m for different Q m are significantly different. For the 
case of N = 4 and 5, we will show simulation results indicating that the throughputs of different Q m are 
rather close, and therefore it does not matter which Q m we use. 



C. Generalization 

As mentioned in the introduction, most prior works for multi-way relay networks focus on two patterns 
of transmissions. The first is pairwise unicast, in which stations form pairs, and the two stations of a pair 
only communicate with each other [|2), [|3). The second is the full data exchange, in which each station 
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needs to broadcast to all the other stations [@], [|5). In practice, however, the actual transmission patterns 
could be different from these two patterns. For example, for video conferencing, a subset of stations 
within the network forms a multicast group, and the transmission pattern is somewhere between the two 
extremes above. 

More generally, in the same network, there could be the co-existence of broadcast sessions, multicast 
sessions, pairwise unicast sessions, and unidirectional unicast sessions. The MIMO switching framework 
here is flexible and encompasses this generality. For easy explanation, our previous discussion in Part B 
has an implicit assumption (focus) that each station i wants to send different data to different stations 
j 7^ i. If we examine the scheme carefully, this assumption is not necessary. In the scheme, a station will 
have chances to transmit to all other stations. In particular, a station i will have chances to transmit data 
to two different stations j and k in two different derangements. If so desired, station i could transmit the 
same data to stations j and k in the two derangements. This observation implies that the general traffic 
pattern can be realized. 

For illustration, let us examine how the traffic pattern of Fig. Q] can be realized. This example is a pattern 
consisting of the co-existence of unicast and broadcast. As has been described, the data transmission can 
be realized by scheduling a condensed derangement set, which is Di = [e 3 , ei, e 2 ] and D 2 = [e 2 , e 3 , ei], 
and e n contains 1 in the nth position and zeros elsewhere. The transmitted data of station 1, 2 and 3 are 
respectively [a, 6, e] T for D\ and [a,c,d] T for D 2 . 

IV. Simulation 

In this section, we evaluate the throughputs achieved by different condensed sets. We assume that the 
uplink channel H u and downlink channel H d are reciprocal, i.e., H d = H T U , and they both follow the 
complex Gaussian distribution jV c (0, J). We assume the relay has the same transmit power as all the 
stations, i.e., p — 1. 

We will answer the question raised in Problem Definition 2. We analyze the scenarios where N = 4 
and N — 5. The four different condensed derangement sets of N = 4, Qi, Q 2 , Q3 and Q4, are considered 
for fair switching. For each channel realization, we evaluate the throughput per station T m as defined by 
(fT4l) . We simulated a total of 10000 channel realizations and computed HL{T m } averaged over the channel 
realizations. Recall that for random-phase algorithm, there are two associated parameters: number of trials 
L and number of bins M (see Section II). We find that for a fixed L and a fixed M, the four condensed 
derangement sets yield essentially the same average throughput (within 1% in the medium and high SNR 
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regimes and within 2% in the low SNR regime). Fig. |4] plots the throughput for one of the condensed 
derangement set for different L and M. For N = 5 there are 56 different condensed derangement sets. 
As with the N = 4 case, all the sets have roughly the same average throughput (within 1%). Fig. [5] also 
plots the results of one set. We also note that increasing L and M beyond 10 and 8 respectively yields 
little throughput gain. This implies that our heuristic yields near optimal result when L = 10 and M = 8. 

We conjecture that different condensed derangement sets achieve roughly the same average throughput 
for N larger than 5 as well. A concrete proof remains an open problem. The ramification of this result, 
if valid, is as follows. For large N, the number of condensed derangement set is huge, and choosing the 
optimal set is a complex combinatorial problem. However, if their relative performances do not differ 
much, choosing any one of them in our engineering design will do, significantly simplifying the problem. 

A scheme proposed in investigates a similar problem as ours. It simply uses a positive scalar weight 
to control the relay power consumption instead of our diagonal A. As a comparison, we also plot the 
throughputs of the scalar scheme in [JVj . Our scheme with diagonal A outperforms the scalar scheme 
by ldB and 0.5dB in Fig. |4] and Fig. [5J Besides the advantage in throughput, our scheme has another 
advantage over the scalar scheme in that our scheme guarantees fairness. That is, in our basic scheme, 
each station has exactly the same throughput, while the stations in the scheme in Q could have varying 
throughputs. The scalar scheme in []Vj focuses on optimizing the sum rate of all stations; the individual 
rates of the stations may vary widely with only one degree of freedom given by the scalar. 

To sum up this section, we state the following general result: 

General Result: In our framework of MIMO fair switching with 4 or 5 stations, any condensed 
derangement set can be used because different condensed derangement sets achieve roughly the same 
average throughput. We conjecture that this will be the case when the number of stations is large as well. 
If this conjecture holds, then the issue of condensed set selection will go away, and the complexity of the 
optimization problem will be greatly reduced. This conjecture remains to be proven. 

V. Conclusions 

We have proposed a framework for wireless MIMO switching to facilitate communications among 
multiple wireless stations. A salient feature of our MIMO switching framework is that it can cater to 
general traffic patterns consisting of a mixture of unicast traffic, multicast traffic, and broadcast traffic 
flows among the stations. 
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There are many nuances and implementation variations arising out of our MIMO switching framework. 
In this paper, we have only studied the "fair switching" setting in which each station wants to send equal 
amounts of traffic to all other stations. In this setting, we aim to deliver the same amount of data from 
each station i to each station j ^ i by scheduling a set of switch matrices. In general, many sets of 
switch matrices could be used for such scheduling. The problem of finding the set that yields optimal 
throughput is a very challenging problem combinatorially. Fortunately, for number of stations iV = 4 or 5, 
our simulation results indicate that different sets of switch matrices achieve roughly the same throughput, 
essentially rendering the selection of the optimal set a non-issue. We conjecture this will be the case 
for larger iV as well. If this conjecture holds, then the complexity of the optimization problem can be 
decreased significantly as far as engineering design is concerned. 

There are many future directions going forward. For example, the beamforming matrices used in our 
simulation studies could be further optimized. Physical-layer network coding could be considered to 
improve throughput performance [|S1. In addition, the setting in which there are unequal amounts of traffic 
between stations will be interesting to explore. Also, this paper has only considered switch matrices that 
realize full permutations in which all stations participate in transmission and reception in each slot; it 
will be interesting to explore switch matrices that realize connectivities among stations that are not full 
permutations. Finally, future work could also explore the case where the number of antenna at the relay 
is not exactly N. 
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TABLE I 

Derangements of N = 4. 
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Fig. 1. Traffic demand of a three stations example. 
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Fig. 2. A transmission established by two slots of unicast connectivity realizes the traffic demand in Fig. [TJ 
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Fig. 3. Wireless MIMO switching. 
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diag A 1 trials & 8 bins 
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Fig. 4. Average throughput per station under MIMO fair switching when TV = 4. In each case only the result of one condensed derangement 
set is presented because the results of other derangement sets are within 2% of the results shown here. 
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Fig. 5. Average throughput per station under MIMO fair switching when TV = 5. In each case only the result of one condensed derangement 
set is presented because the results of other derangement sets are within 1% of the results shown here. 



